Variance-Based Exploration for Learning Model Predictive Control
نویسندگان
چکیده
The combination of model predictive control (MPC) and learning methods has been gaining increasing attention as a tool to systems that may be difficult model. Using MPC function approximator in reinforcement (RL) is one approach reduce the reliance on accurate models. RL dependent exploration learn, currently, simple heuristics based random perturbations are most common. This paper considers variance-based geared towards using approximator. We propose use non-probabilistic measure uncertainty value value-based methods. Uncertainty measured by variance estimate inverse distance weighting (IDW). IDW framework computationally cheap evaluate therefore well-suited an online setting, already sampled state transitions rewards. gradient then used perturb policy parameters direction where increasing. proposed method verified two simulation examples, considering both linear nonlinear system dynamics, compared standard perturbations.
منابع مشابه
Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning
Learning-based methods have been successful in solving complex control tasks without significant prior knowledge about the system. However, these methods typically do not provide any safety guarantees, which prevents their use in safety-critical, real-world applications. In this paper, we present a learning-based model predictive control scheme that provides provable high-probability safety gua...
متن کاملLinear model predictive safety certification for learning-based control
While it has been repeatedly shown that learningbased controllers can provide superior performance, they often lack of safety guarantees. This paper aims at addressing this problem by introducing a model predictive safety certification (MPSC) scheme for polytopic linear systems with additive disturbances. The scheme verifies safety of a proposed learningbased input and modifies it as little as ...
متن کاملLearning-based model predictive control for Markov decision processes
We propose the use of Model Predictive Control (MPC) for controlling systems described by Markov decision processes. First, we consider a straightforward MPC algorithm for Markov decision processes. Then, we propose value functions, a means to deal with issues arising in conventional MPC, e.g., computational requirements and sub-optimality of actions. We use reinforcement learning to let an MPC...
متن کاملLearning Model Predictive Control for Iterative Tasks
A Learning Model Predictive Controller (LMPC) for iterative tasks is presented. The controller is referencefree and is able to improve its performance by learning from previous iterations. A safe set and a terminal cost function are used in order to guarantee recursive feasibility and nonincreasing performance at each iteration. The paper presents the control design approach, and shows how to r...
متن کاملModel-Based Predictive Control
MBPC is a feedback-control methodology suitable to enforce efficiently hard constraints on the variables of the controlled system. It is shown that the method hinges upon a constrained open-loop optimal control problem along with the adoption of the so-called receding-horizon control strategy. In the important case of time-invariant linear saturated ANCBI systems, MBPC algorithms can be devised...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3282842